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(57) Table data (201 ) is stored by parsing the table 
data into columns (221 , 215, 209) of values, formatting 
each column into a data stream, and transferring each 
data stream to a storage device for storage as a contin- 
uous strip of data. The strip of data is stored as a file 
(223, 225; 217, 219; 211, 213) that is not structured as 
a series of pages. The formatting of the data stream may 
include compressing the column values to minimize the 
length of the data strip. A particular compression proce- 
dure may be used that derives a code for each value in 
the column from a number of occurrences of the value 
in the column and replaces the value in the data stream 
with the corresponding code. 
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In this paper a new method to improve the utilization of main memory systems is 
presented. The new method is based on prestoring in main memory a number of query 
answers, each evaluated out of a single memory page. To this end, the ideas of page- 
answers and page-traces are formally described and their properties analyzed. The query 
model used here allows for selection, projection, join, recursive queries as well as 
arbitrary combinations. We also show how to apply the approach under update ... 
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For many applications, it is important to quickly locate the nearest neighbor of a given 
time series. When the given time series is a streaming one, nearest neighbors may need 
to be found continuously at all time positions. Such a standing request is called a 
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continuous nearest neighbor query. This paper seeks fast evaluation of continuous queries 
on large databases. The initial strategy is to use the result of one evaluation to restrict 
the search space for the next. A more fundamental i ... 

Keywords: continuous query, nearest neighbor, streaming time series 
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A quantitative analysis of a large collection of expert-rated web sites reveals that page- 
level metrics can accurately . predict if a site will be highly rated. The analysis also provides 
empirical evidence that important metrics, including page composition, page formatting, 
and overall page characteristics, differ among web site categories such as education, 
community, living, and finance. These results provide an empirical foundation for web site 
design guidelines and also suggest which me ... 

Keywords: Web site design, World Wide Web, automated usability evaluation, empirical 
studies 
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Full text available- Mpdf'1 1.1 MB) Additional Information: full citation , abstract, references, citings, Index 
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In this paper, we study the scheduling and optimization problems of parallel query 
processing using interoperation parallelism in a shared-memory environment and propose 
our solutions for XPRS. We first study the scheduling problem of a set of a continuous 
sequence of independent tasks that are either from a bushy tree plan of a single query or 
from the plans of multiple queries, and present a clean and simple scheduling algorithm. 
Our scheduling algorithm achieves maximum resource utilizat ... 

10 Improving statistical language model performance with automatically generated word Q 
hierarchies 

John G. McMahon, Francis J. Smith 
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Publisher: MIT Press 

Full text available: ~ rifl] 
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Publisher Site 

An automatic word-classification system has been designed that uses word unigram and 
bigram frequency statistics to implement a binary top-down form of word clustering and 
employs an average class mutual information metric. Words are represented as structural 
tags— n-bit numbers the most significant bit-patterns of which incorporate class 
information. The classification system has revealed some of the lexical structure of 
English, as well as some phonemic and semantic structure. The syst ... 
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Bruce Wilcox 

October 1985 ACM SIGART Bulletin, issue 94 
Publisher: ACM Press 

Full text available: ^.pdf£t42 MB). Additional Information: Mi.c|tation, abstract, refejences 

From 1972 to 1979 I co-designed and built what became the world's strongest computer 
Go program, the Reitman-Wilcox Go Program [1]. It took 7 person-years, 8K lines of 
LISP, 3 megabytes of memory, and an IBM mainframe. Recently I constructed a similar 
program, called NEMESIS.. .the Go Master (tm). It has taken 1 person-year, 13. 5K lines of 
C, 146 kilobytes of memory, and an IBM-PC. They play at a similar strength and by 
similar means. This article discusses both how I went about reengineering the ... 
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A long-standing research problem in computer graphics is to reproduce the visual 
experience of walking through a large photorealistic environment interactively. On one 
hand, traditional geometry-based rendering systems fall short of simulating the visual 
realism of a complex environment. On the other hand, image-based rendering systems 
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have to date been unable to capture and store a sampled representation of a large 
environment with complex lighting and visibility effects. In this paper, we prese ... 

Keywords: capture, image-based rendering, interactive, reconstruction, walkthrough 
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In many applications, local or remote sensors send in streams of data, and the system 
needs to monitor the streams to discover relevant events/patterns and deliver instant 
reaction correspondingly. An important scenario is that the incoming stream is a 
continually appended time series, and the patterns are time series in a database. At each 
time when a new value arrives (called a time position), the system needs to find, from the 
database, the nearest or near neighbors of the incoming time serie ... 
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Full text available: ^pdf{967.44 KB) Additional Information: full citation , abstract, references, index terms 

In this paper, we address the problem of clustering graphs in object-oriented databases. 
Unlike previous studies which focused only on a workload consisting of a single operation, 
this study tackles the problem when the workload is a set of operations (method and 
queries) that occur with a certain probability. Thus, the goal is to minimize the expected 
cost of an operation in the workload, while maintaining a similarly low cost for each 
individual operation class.To this end, we ... 

Keywords: graph partitioning, object-oriented database systems, performance analysis, 
storage techniques 
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This paper develops a methodology for compiling and executing irregular parallel 
programs. Such programs implement parallel operations whose size and work distribution 
depend on input data. We show a fundamental relationship between three quantities that 
characterize an irregular parallel computation: the total available parallelism, the optimal 
grain size, and the statistical variance of execution times for individual tasks. This 
relationship yields a dynamic scheduling al ... 
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This paper describes an implemented robotic agent architecture in which the environment, 
as sensed by the agent, is used to guide the recognition of spoken and gestural directives 
given by a human user. The agent recognizes these directives using a probabilistic 
language model that conditions probability estimates for possible directives on visually-, 
proprioceptive^-, or otherwise-sensed properties of entities in its environment, and 
updates these probabilities when these properties change. Th ... 

Keywords: language modeling, multi-modal interfaces, robotics, sensor fusion, spoken 
language interfaces 
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The trend towards object-oriented software construction is becoming more and more 
prevalent, and parallel programming cannot be an exception. In the context of parallel 
computation, it is often natural to model the computation as message passing between 
autonomous, concurrently active objects. The problem was, as some previous studies had 
indicated, that the overhead from message reception to dynamic method dispatching 
consumes a significant amount of execution time (e.g., as much as 4000 m ... 
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Sophisticated disk scheduling algorithms require accurate, detailed disk drive 
specifications, including data about mechanical delays, on-board caching and prefetching 
algorithms, command and protocol overheads, and logical-to-physical block mappings. 
Comprehensive disk models used in storage subsystem design require similar levels of 
detail. We describe a suite of general-purpose algorithms and techniques for acquiring the 
necessary information from a SCSI disk drive. Using only the ANSI-standa ... 

19 The Graft-Host method for design change Q 

Guillermo Arango, Eric Schoen, Robert Pettengill, Josiah Hoskins 

May 1993 Proceedings of the 15th international conference on Software Engineering 
Publisher: IEEE Computer Society Press 

Full text available: f§l| pdf(1.11 MB) Additional Information: full citation, references, citings 



http://portal.acm.org/results.cfm?coll=ACM&dI=ACM&CFID=60 1 793 1 7&CFTOKEN=42. . . 12/1 5/05 



Results (page 1): compress$6 and continuous and ((data adj page) or (series near pages!) ... Page 6 of 6 




20 Limitations of cache prefetching on a bus-based multiprocessor Q 

Dean M. Tullsen, Susan J. Eggers 

May 1993 ACM SIGARCH Computer Architecture News , Proceedings of the 20th 

annual international symposium on Computer architecture ISCA '93, Volume 

21 Issue 2 

Publisher: ACM Press 

Full text available: "fH pdff1.10 MB! Additional Information: MLcjtatjon, abstract, references, citings, index 

* ™ ' terms 

Compiler-directed cache prefetching has the potential to hide much of the high memory 
latency seen by current and future high-performance processors. However, prefetching is 
not without costs, particularly on a multiprocessor. Prefetching can negatively affect bus 
utilization, overall cache miss rates, memory latencies and data sharing. We simulated 
the effects of a particular compiler-directed prefetching algorithm, running on a bus-based 
multiprocesssor. We showed that, despite a high mem ... 
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Research aimed at correcting words in text has focused on three progressively more 
difficult problems:(l) nonword error detection; (2) isolated-word error correction; and (3) 
context-dependent work correction. In response to the first problem, efficient pattern- 
matching and n-gram analysis techniques have been developed for detecting strings that 
do not appear in a given word list. In response to the second problem, a variety of 
general and application-specific spelling cor ... 

Keywords: n-gram analysis, Optical Character Recognition (OCR), context-dependent 
spelling correction, grammar checking, natural-language-processing models, neural net 
classifiers, spell checking, spelling error detection, spelling error patterns, statistical- 
language models, word recognition and correction 




8 



ABCL/onEM-4: a new software/hardware architecture for object-oriented concurrent 



H 



http://portal.acm.org/results.cfm?coll=ACM&dl=ACM&CFID : =601793 17&CFTOKEN=42... 12/1 5/05 



Results (page 1): compress$6 and continuous and ((data adj page) or (series near pages!) ... Page 3 of 6 



<S5* 



Masahiro Yasugi, Satoshi Matsuoka, Akinori Yonezawa 

August 1992 Proceedings of the 6th international conference on Supercomputing 
Publisher: ACM Press 

Additional Information: feJJ.citatjpn J abstract, references, citings, index 



Full text available: IS pdf[1.39 MB) 

^ terms 

The trend towards object-oriented software construction is becoming more and more 
prevalent, and parallel programming cannot be an exception. In the context of parallel 
computation, it is often natural to model the computation as message passing between 
autonomous, concurrently active objects. The problem was, as some previous studies had 
indicated, that the overhead from message reception to dynamic method dispatching 
consumes a significant amount of execution time (e.g., as much as 4000 m ... 

9 SpecjaIjssue.on.kn Q 



M± Ronald J. Brachman, Brian C. Smith 

February 1980 ACM SIGART Bulletin, issue 70 

Publisher: ACM Press 

Full text available: ^.pdfti3.1. 3.M8) Additional Information: Ml citation, abstract 

In the fall of 1978 we decided to produce a special issue of the SIGART Newsletter 
devoted to a survey of current knowledge representation research. We felt that there 
were twe useful functions such an issue could serve. First, we hoped to elicit a clear 
picture of how people working in this subdiscipline understand knowledge representation 
research, to illuminate the issues on which current research is focused, and to catalogue 
what approaches and techniques are currently being developed. Secon ... 

1 0 Unit jcMiQD. encoding H 

Stephen G. Puiman 

September 1996 Computational Linguistics, volume 22 issue 3 

Publisher: MIT Press 
Full text available: 



^p„diC2,.Qi.MBi.!o Additional Information: full citation, abstract, references, citings 
Publisher Site 

This paper describes various techniques for enriching unification-based grammatical 
formalisms with notational devices that are compiled into categories and rules of a 
standard unification grammar. This enables grammarians to avail themselves of 
apparently richer notations that allow for the succinct and relatively elegant expression of 
grammatical facts, while still allowing for efficient processing for the analysis or synthesis 
of sentences using such grammars. 

11 Special issue: Al in engineering Q 

D. Sriram, R. Joobbani 
April 1985 ACM SIGART Bulletin, issue 92 

Publisher: ACM Press 

Full text available: fj &odft8.79 MB) Additional Information: full citation, abstract 




The papers in this special issue were compiled from responses to the announcement in 
the July 1984 issue of the SIGART newsletter and notices posted over the ARPAnet. The 
interest being shown in this area is reflected in the sixty papers received from over six 
countries. About half the papers were received over the computer network. 

12 Special issue on word sense disambiguation: Introduction to the special issue on Q 
word sense disambiguation: the state of the art 

Nancy Ide, Jean V^ronis 

March 1998 Computational Linguistics, volume 24 issue 1 



http://portal.acm.org/resu^ 12/15/05 



Results (page 1): compress$6 and continuous and ((data adj page) or (series near pages!) ... Page 4 of 6 



Publisher: MIT Press 
Full text available 



||.P.AO,44.MBj„flj* 1 Additional Information: Ml.cjtati.on, references, citings 
Publisher Site 



13 Computing curricula 2001 



^ September 2001 Journal on Educational Resources in Computing (JERIC) 

Publisher: ACM Press 

Full text available: odf(61 3.63 KB) 

77" 7A™777777 ' Additional Information: full citation, references, citings, index terms 
htm!(2.78 KB) ' ' ^ 



14 Special section: Reasoning about structure, behavior and function Q 

B. Chandrasekaran, Rob Milne 
July 1985 ACM SIGART Bulletin, issue 93 

Publisher: ACM Press 

Full text available: *ffl) pdf(5.13 MB) Additional Information: full citation, abstract, references 




The last several years' of work in the area of knowledge-based systems has resulted in a 
deeper understanding of the potentials of the current generation of ideas, but more 
importantly, also about their limitations and the need for research both in a broader 
framework as well as in new directions. The following ideas seem to us to be worthy of 
note in this connection. 

1 5 B.§vised„Report .of the Algorithmic H 

A. van Wijngaarden 

August 1981 ALGOL Bulletin, issue sup 47 
Publisher: Computer History Museum 

Full text available: ^pdf(9 ; 2Q MBj. Additional Information: Mlcrtation, index terms 



16 Fast.dMectjoji Q 

Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 

Studies on Collaborative research 
Publisher: IBM Press 

Full text available: ^.pdf(4.21..MBj Additional Information: Ml citation, abstract references, index tejros 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the 
execution of the application. The visualization tool we use is Poet, an event tracer 
developed at the University of Waterloo. However, these diagrams are often very complex 
and do not provide the user with the desired overview of the application. In our 
experience, such tools display repeated occurrences of non-trivial commun ... 

17 Ihe„FINfIE.STRjNG..Ne\ys Q 
Computational Linguistics Staff 

January 1987 Computational Linguistics, Volume 13 issue 1-2 

Publisher: MIT Press 

Full text available: m Jr ,.. ,r- (HI 

|§P..^>J5. }AB)W Additional Information: full citation 

Publisher Site 



http://portal.acm.org/results.cfm?coll=ACMi&dl=ACM&CFID=60179317& 12/15/05 



Results (page 1): compress$6 and continuous and ((data adj page) or (series near pages!) ... Page 5 of 6 



18 Jext.cate3orjzation Q 

^ Paolo Frasconi, Giovanni Soda, Alessandro Vullo 

^ January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital 

libraries 
Publisher: ACM Press 

Full text available: ^.pdfi280.05 KB) Additional Information: Mt.cjtat.iQn, abstract, references, index terrns 

Text categorization is typically formulated as a concept learning prob lem where each 
instance is a single isolated document. In this paper we are interested in a more general 
formulation where documents are organized as page sequences, as naturally occurring in 
digital libraries of scanned books and magazines. We describe a method for classifying 
pages of sequential OCR text documents into one of several assigned categories and 
suggest that taking into account contextual information provid ... 

Keywords: hidden Markov models, multi-page documents, naive Bayes classifier, text 
categorization 



19 Data compression Q 

Debra A. Lelewer, Daniel S. Hirschberg 

September 1987 ACM Computing Surveys (CSUR), Volume 19 Issue 3 
Publisher: ACM Press 

Full text available- fB pdf(3.61 MB) Additional Information: full citation , abstract, references, citings, index 

• : * terms, review 

This paper surveys a variety of data compression methods spanning almost 40 years of 
research, from the work of Shannon, Fano, and Huffman in the late 1940s to a technique 
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In this paper a new method to improve the utilization of main memory systems is 
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